Multi-objective Bandits: Optimizing the Generalized Gini Index

نویسندگان

  • Róbert Busa-Fekete
  • Balázs Szörényi
  • Paul Weng
  • Shie Mannor
چکیده

We study the multi-armed bandit (MAB) problem where the agent receives a vectorial feedback that encodes many possibly competing objectives to be optimized. The goal of the agent is to find a policy, which can optimize these objectives simultaneously in a fair way. This multi-objective online optimization problem is formalized by using the Generalized Gini Index (GGI) aggregation function. We propose an online gradient descent algorithm which exploits the convexity of the GGI aggregation function, and controls the exploration in a careful way achieving a distribution-free regret Õ(T 1/2) with high probability. We test our algorithm on synthetic data as well as on an electric battery control problem where the goal is to trade off the use of the different cells of a battery in order to balance their respective degradation rates.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Generalized Gittins Index for a Class of Multiarmed Bandits with General Resource Requirements

We generalise classical multi-armed and restless bandits to allow for the distribution of a (fixed amount of a) divisible resource among the constituent bandits at each decision point. Bandit activation consumes amounts of the available resource which may vary by bandit and state. Any collection of bandits may be activated at any decision epoch provided they do not consume more resource than is...

متن کامل

A Note on Bandits with a Twist

A variant of the multi-armed bandit problem was recently introduced by Dimitriu, Tetali and Winkler. For this model (and a mild generalization) we propose faster algorithms to compute the Gittins index. The indexability of such models follows from earlier work of Nash on generalized bandits.

متن کامل

Estimating Quality in User-Guided Multi-Objective Bandits Optimization

Many real-world applications are characterized by a number of conflicting performance measures. As optimizing in a multi-objective setting leads to a set of non-dominated solutions, a preference function is required for selecting the solution with the appropriate trade-off between the objectives. This preference function is often unknown, especially when it comes from an expert human user. Howe...

متن کامل

The Generalized Gini index and the measurement of income mobility

Two new normative indices of mobility are proposed. The first one is a population weighted generalized Gini mobility index and will be higher, the higher the size of the transfer between two individuals and, for a given transfer, the higher the rank difference between the individuals between whom the transfer takes place. This index is also higher, the greater the rank gap between the individua...

متن کامل

Generalized Gin1 Inequality Indices

When incomes are ranked in descending order the social-evaluation function corresponding to the Gini relative inequality index can be written as a linear function with the weights being the odd numbers in increasing order. We generalize this function by allowing the weights to be an arbitrary non-decreasing sequence of numbers. This results in a class of generalized Gini relative inequality ind...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017